Претрага
27 items
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
A bilingual digital library for academic and entrepreneurial knowledge management
A generic knowledge management process of organization, storage and retrieval of knowledge can suitably be fitted in a digital library. In the digital and knowledge age digital libraries can be used in knowledge management to handle intellectual assets and support knowledge creation. A multilingual digital library either stores content in more than one language or provides multilingual query access to monolingual content. In Serbia 18 of 308 scientific journals regularly published are bi-lingual, with papers simultaneously being in English ...... domains of library and information sciences, digital humanities, architecture and mining and also project reports on academic and entrepreneurial knowledge and e-learning. It contains articles from three journals INFOtheca - Journal for Digital Humanities, Underground Mining Engineering, and Ar ...
... is PhD candidate at Faculty of Philology, University of Belgrade. Her scientific work is primarily oriented toHuman Language Technologies, Digital Humanities and e-Learning. Dalibor Vorkapić is research assistant and system administrator at Faculty of Mining and Geology at University of Belgrade ...
... and retrieval of knowledge can suitably be fitted in a digital library. In the digital and knowledge age digital libraries can be used in knowledge management to handle intellectual assets and support knowledge creation. A multilingual digital library either stores content in more than one language ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Dalibor Vorkapić. "A bilingual digital library for academic and entrepreneurial knowledge management" in Proceeding of 10th International Forum on Knowledge Asset Dynamics — IFKAD 2015: Culture, Innovation and Entrepreneurship: connecting the knowledge dots, Bari, Italy, 10-12 June 2015, Bari : IFKAD (2015)
-
Infotheca (Q25460443) in Wikidata
Ranka Stanković, Lazar Davidović (2021)Vikipodaci su baza znanja Zadužbine Vikimedija koja predstavlja zajednički izvor različitih vrsta podataka koje koriste ne samo drugi Vikipedijini projekti, već sve više i brojne aplikacije semantičkog veba. U ovom radu ćemo prezentovati primer integracije Vikipodataka sa digitalnim bibliotekama i eksternim sistemima, kao i mogućnost ubrzanja pripreme i unosa podataka na primeru radova iz časopisa za digitalnu humanistiku Infoteka.... present an example of integra- tion of Wikidata with digital libraries and ex- ternal systems, as well as the potential for speeding up the process of data preparation and entry using the articles published in In- fotheca, Journal for Digital Humanities as an example. KEYWORDS: Semantic Web, Open Linked ...
... present an example of integration of Wikidata with digital libraries and external systems, as well as the potential for speeding up the process of data preparation and entry using the articles published in In- fotheca, Journal for Digital Humanities as an example. Semantic web is an extension of the ...
... already exists and is stored in different digital formats. With proper prior preparation, it can be entered in Wikidata semiautomatically. Therefore, the basic idea was to speed up the entry of data about the results of the research in the domain of digital humanities in Serbia, as well as about old Serbian ...Ranka Stanković, Lazar Davidović. "Infotheca (Q25460443) in Wikidata" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.5
-
The Use of the Omeka Semantic Platform for the Development of the University of Belgrade, Faculty of Mining and Geology Digital Repository
Under the regulations of the Ministry of Education, Science and technological Development, a digital repository based on the Omeka S data storage platform has been developed for the Faculty of Mining and Geology. The platform has been upgraded with the required modular extensions, Solr index and automatic OCR. Furthermore, document indexing and search have been fine-tuned with the aid of e-dictionaries of the Serbian language, which has brought about outstanding results in terms of usage facilitation and overall ...Petar Popović, Mihailo Škorić, Biljana Rujević. "The Use of the Omeka Semantic Platform for the Development of the University of Belgrade, Faculty of Mining and Geology Digital Repository" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2020.20.1_2.9
-
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals
This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...... Search of Multilingual Digital Libraries of E-journals Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals ...
... ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs A tool for enhanced search of multilingual digital libraries ...
... possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
-
Keyword-Based Search on Bilingual Digital Libraries
This paper outlines the main features of Biblisha, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Biblishsa supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic ...Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović. "Keyword-Based Search on Bilingual Digital Libraries" in Semantic Keyword-Based Search on Structured Data Sources - Second COST Action IC1302 International KEYSTONE Conference, IKC 2016, Springer (2017). https://doi.org/10.1007/978-3-319-53640-8_10
-
Knowledge and Rule-Based Diacritic Restoration in Serbian
In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).... where the expert approach and the crowdsourcing will be combined. For Russian, traditional information-retrieval thesauri in social sciences and the humanities have been developed and are supported in the Institute of Scientific Information of Russian Academy of Sci- ences (INION RAN). This institution ...
... specialized linguistic ontologies. Proceedings of Ontolex 2002, pages 43–48. Mdivani, R. (2013). Thesauri of the isiss ras for social sciences and humanities. Scientific and Technical Informa- tion Processing, 40(3):137–141. Proceedings of CLIB 2018 102 Miller, G. A., Beckwith, R., Fellbaum, C., Gross ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
-
Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources
Large collections of textual documents represent an example of big data that requires the solution of three basic problems: the representation of documents, the representation of information needs and the matching of the two representations. This paper outlines the introduction of document indexing as a possible solution to document representation. Documents within a large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named ...... large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named entity recognition. Documents in this geological database are described by a summary report, and other data, such as ...
... large textual database developed for geolog- ical projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named entity recognition. Documents in this geological database are described AQ2 by a summary report, and other data, such ...
... extracting and selecting specific terms (words) from the document text. Language processing methods and techniques devel- oped within the field of digital humanities are used for completing this task. They provide for determining the boundaries of sentences within the document text, tokenization, stemming ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources" in Trans. Computational Collective Intelligence - Lecture Notes in Computer Science 26, Springer (2017). https://doi.org/10.1007/978-3-319-59268-8_8
-
Managing mining project documentation using human language technology
Purpose: This paper aims to develop a system, which would enable efficient management and exploitation of documentation in electronic form, related to mining projects, with information retrieval and information extraction (IE) features, using various language resources and natural language processing. Design/methodology/approach: The system is designed to integrate textual, lexical, semantic and terminological resources, enabling advanced document search and extraction of information. These resources are integrated with a set of Web services and applications, for different user profiles and use-cases. Findings: The ...Digital libraries, Information retrieval, Data mining, Human language technologies, Project documentationAleksandra Tomašević, Ranka Stanković, Miloš Utvić, Ivan Obradović, Božo Kolonja . "Managing mining project documentation using human language technology" in The Electronic Library (2018). https://doi.org/10.1108/EL-11-2017-0239
-
Serbian NER&Beyond: The Archaic and the Modern Intertwinned
U ovom radu predstavljamo srpski književni korpus koji se razvija pod okriljem COST Akcije „Distant Reading for European Literary History” CA16204. Koristeći ovaj korpus romana napisanih pre više od jednog veka, razvili smo i učinili javno dostupnim Sistem za prepoznavanje imenovanih entiteta (NER) obučen da prepozna 7 različitih tipova imenovanih entiteta, sa konvolucionom neuronskom mrežom (CNN), koja ima F1 rezultat od ≈91% na test skupu podataka. Ovaj model je dalje ocenjen na posebnom skupu podataka za evaluaciju. Završavamo poređenje ...... first Serbian Literature Cor- pus of the Late 19th and Early 20th century with the TXM platform. In DH_BUDAPEST_2019, pages 36–37. Centre for Digital Humanities - Eötvös Loránd Univer- sity. http://elte-dh.hu/wp-content/uploads/ 2019/09/DH_BP_2019-Abstract-Booklet.pdf. Cvetana Krstev, Ivan Obradović ...
... Computation, 24(2):473–489. Cvetana Krstev and Ranka Stanković. 2020. Old or New, we Repair, Adjust and Alter (Te- xts). Infotheca - Journal for Digital Humanities, 19(2):61–80. Denis Maurel, Nathalie Friburger, and Iris Eshkol- Taravella. 2014. Enrichment of Renaissance Te- xts with Proper Names. INFOtheca: ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković, Milica Ikonić Nešić. "Serbian NER&Beyond: The Archaic and the Modern Intertwinned" in Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, INCOMA Ltd. Shoumen, BULGARIA (2021). https://doi.org/10.26615/978-954-452-072-4_141
-
Preparation of Multimedia Document “YU Rock Scene”
SUMMARY: This study will present the preparation process of a multimedia document entitled YU ROCK SCENE in which participants were senior students of undergraduate studies of the Department of Library and Information Science at the University of Belgrade Faculty of Philology during the academic year 2014/2015, as a part of the subject Multimedia Documents. This study gives an overview of the historical development of rock and roll in the territory of the former Yugoslavia, rock scene in Yugoslav republics, ...... РГФ] Preparation of Multimedia Document “YU Rock Scene” | Milena Obradović, Aleksandra Arsenijević, Mihailo Škorić | Infotheca - Journal for Digital Humanities | 2017 | | 10.18485/infotheca.2016.16.1_2.6 http://dr.rgf.bg.ac.rs/s/repo/item/0004950 Дигитални репозиторијум Рударско-геолошког факултета ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...Milena Obradović, Aleksandra Arsenijević, Mihailo Škorić. "Preparation of Multimedia Document “YU Rock Scene”" in Infotheca - Journal for Digital Humanities, Faculty of Philology, University of Belgrade (2017). https://doi.org/10.18485/infotheca.2016.16.1_2.6
-
EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School
Prva škola za obuku polaznika koju je organizovala COST akcija NexusLinguarum održana je od 8. do 12. februara 2021. godine sa ciljem da studenti, istraživači i stručnjaci nauče osnove lingvističke nauke o podacima. Tokom obuke polaznici su se upoznali sa širokim spektrom tema: od semantičkog veba, RDF -a i ontologija, do modeliranja i pretraživanja jezičkih podataka pomoću najsavremenijih ontoloških modela i alata. Škola je održana u okviru serije letnjih škola EUROLAN-a i organizovalo ju je virtuelno (onlajn) nekoliko instituta; ...nauka o lingvističkim podacima, povezani podaci u lingvistici, jezički podaci, EUROLAN, NexusLinguarum, COST akcija, škola za obuku... with a survey form to gather feedback on both organizational and academic aspects of the school. The results have shown that the disciplines of the humanities/linguistics/lexicography had a higher representation among participants than computer science, and that the school was well-focused, well-balanced ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.7
-
SrpELTeC on Platforms: Udaljeno čitanje, Aurora, NoSketch
Serbian ELTeC collection (100 novels and extended) developed within COST action CA16204 Distant Reading for European Literary History comprises at this moment 111 novels published in the period 1840-1920. Such a valuable resource is and will be used for various lexical and linguistic research, by using different tools and methodologies. In this paper, three platforms on which these novels are published will be presented: “Udaljeno ˇcitanje”, Aurora and Sketch Engine.Ranka Stanković, Mihailo Škorić, Petar Popović. "SrpELTeC on Platforms: Udaljeno čitanje, Aurora, NoSketch" in Infotheca, Faculty of Philology, University of Belgrade (2022). https://doi.org/10.18485/infotheca.2021.21.2.7
-
Primena digitalne fotogrametrije u rudniku Rudnik
Digitalna fotogrametrija, lasersko skeniranje i virtuelizacija sve više pronalaze aktivne uloge u rudarstvu, prvenstveno zbog svoje široke mogućnosti primene. Laserskim skeniranjem i fotogrametrijom snimaju se i kreiraju 3D modeli otkopa, jamskih hodnika, površinskih kopova, površinskih objekata (zgrada, spomenika, predmeta), a vrši se skeniranje celih rudnika koji se rekonstruišu u 3D digitalne modele. Fotogrametrija predstavlja proces fotografisanja objekata pomoću fotoaparata i daljom obradom fotografija u različitim softverskim alatima kreiraju se realne kopije 3D modela objekata ili predmeta u digitalnoj formi. Za ...... are scanned and reconstructed into 3D digital models. Photography is the process of photographing objects using a camera and by further processing the photos in various software tools, realistic copies of 3D models of objects or tools are created in digital form. For the development of high-quality ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...
... dostupan široj Javnosti. Ključne reči: digitalna fotogrametrija, prostorna vizuelizacija, rudarski objekti, podzemna eksploatacija Abstract Digital photogrammetry, laser scanning and virtualization are increasingly finding active roles in mining, primarily due to their wide application possibilities ...Nikola Mirković, Luka Crnogorac, Katarina Urošević. "Primena digitalne fotogrametrije u rudniku Rudnik" in XI Međunarodna konferencija ugalj i kritični minerali CCM 2023, Zlatibor, 11-14. oktobar 2023., Beograd : Jugoslovenski komitet za površinsku eksploataciju (2023)
-
Old or New, We Repair, Adjust and Alter (Texts)
Cvetana Krstev, Ranka Stanković (2020)U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja... (1992): 377–439 Lazić, Biljana and Mihailo Škorić. “From DELA based Dictionary to Lex- imirka Lexical DataBase”. Infotheca – Journal for Digital Humanities Vol. 19, no. 2 (2019): 00–00, https://infoteka.bg.ac.rs/ojs/index. php/Infoteka/article/view/2019.19.2.4_en Miller, George A and Elizabeth A ...
... 1145/359038.359041 Petković, Ljudmila. “Creation and Analysis of the Yugoslav Rock Song Lyrics Corpus from 1967 to 2003”. Infotheca – Journal for Digital Humanities Vol. 19, no. 1 (2019): 5–29. https://infoteka.bg.ac.rs/ojs/index. php/Infoteka/article/view/2019.19.1.1_en Salloum, Wael and Nizar Habash. ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3
-
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others (2020)Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages ...... Bibliographical References Ahmadi, S., Arcan, M., and McCrae, J. (2018). On lex- icographical networks. In Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence. Burgun, A. and Bodenreider, O. (2001). Comparing terms, concepts and semantic classes in WordNet and the Uni- ...
... Francisco. A. Authors’ affiliations 1Society for Danish Language and Literature (DSL), Copenhagen, Denmark {sn,tt}@dsl.dk 2Austrian Centre for Digital Humanities and Cultural Heritage, Austrian Academy of Sciences, Vienna, Austria tanja.wissik@oeaw.ac.at 3Istituto di Linguistica Computazionale “A. Zampolli– ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others . "A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment" in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, European Language Resources Association (ELRA) (2020)
-
Split-Desktop software for the analysis of fragment size distribution of blasted rock mass
Milanka Negovanović, Lazar Kričak, Stefan Milanović, Jovan Marković, Nikola Simić, Snežana Ignjatović (2023)Drobljenje stena je najvažniji pokazatelj u proceni efekata miniranja pri proizvodnom miniranju u površinskoj eksploataciji. Stepen drobljenja stena ima veliki uticaj na efikasnost daljih operacija utovara, transporta, drobljenja i mlevenja. Optimalno drobljenje stena pri proizvodnom miniranju utiče na smanjenje ukupnih troškova proizvodnje. Stoga je pouzdana procena veličine drobljenja odminirane stenske mase veoma važno pitanje, ne samo u operacijama miniranja, već i u rudarskoj proizvodnji. Za predviđanje distribucije veličine komada odminirane stenske mase postoje različiti empirijski modeli. KUZ-RAM model omogućava ...Milanka Negovanović, Lazar Kričak, Stefan Milanović, Jovan Marković, Nikola Simić, Snežana Ignjatović. "Split-Desktop software for the analysis of fragment size distribution of blasted rock mass" in 9th International Conference Mining and environmental protection, Sokobanja, Serbia, 24 – 27. May 2023, Belgrade : University of Belgrade, Faculty of Mining and Geology (2023)
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... we applied it to a collection of 74 papers in Serbian from the journal Infotheca. 6 The size of the corpus is 6 Infotheca - Journal for Digital Humanities (http://infoteka.bg.ac.rs/index.php/en/infoteca) Proceedings of the conference Terminology and Artificial Intelligence 2015 (Granada, Spain) ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...
... Frantzi, K., Ananiadou, S., & Mima, H. (2000). Au- tomatic recognition of multi-word terms:. the C- value/NC-value method. International Journal on Digital Libraries, 3(2): 115-130. Gelbukh, A., Sidorov, G., Lavin-Villa, E., & Chanona-Hernandez, L. (2010). Automatic Term Extraction Using Log-Likelihood ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
Towards Automatic Definition Extraction for Serbian
U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog ...... engineering for agriculture and tools and mechanization. Logic and philosophy, on the other hand were covered by two high school textbooks focusing on humanities subjects . The domain of music is represented by two music high school textbooks: History of Music and Century of Jazz. The corpus is being developed ...
... приступ издањима Факултета и радовима запослених доступним у слободном приступу. - Претрага репозиторијума доступна је на www.dr.rgf.bg.ac.rs The Digital repository of The University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' ...
... Calzolari et al., pp. 3947–3955. Stijović, Rada, Ranka Stanković. Digitalno izdanje Rečnika SANU: formalni opis mikrostrukture Rečnika SANU. [Digital edition of the SASA Dictionary: a formal description of the microstructure of the SASA Dictionary (in Cyrillic)] In: Naučni sastanak slavista u Vukove ...Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
-
Using English Baits to Catch Serbian Multi-Word Terminology
In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflection... (inflected) dictionaries for Serbian and English; 4.1. Aligned/parallel corpus The English/Serbian textual resource was derived from the journal for Digital Humanities Infotheca3 that is published biannually in Open Access. 12 issues with 84 papers were aligned at sentence level resulting in 14,710 alignment ...
... extracted MWTs 15Phrase table often contains several similar entries of the same phrase. For example, at the digital library, for digital library, because digital library and of the digital library would represent four different entries within phrase table. We observed these as one phrase, in the manner ...
... 2008/. Vitas, D., Popović, L., Krstev, C., Obradović, I., zetić, G. P.-L., and Stanojević, M. (2012). Srpski jezik u digital- nom dobu – The Serbian Language in the Digital Age. META-NET White Paper Series. Georg Rehm and Hans Uszkoreit (Series Editors). Springer. Available online at http://www ...Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)